Top-k Query Evaluation for Schema-Based Peer-to-Peer Networks
نویسندگان
چکیده
Increasing the number of peers in a peer-to-peer network usually increases the number of answers to a given query as well. While having more answers is nice in principle, users are not interested in arbitrarily large and unordered answer sets, but rather in a small set of ”best” answers. Inspired by the success of ranking algorithms in Web search engine and top-k query evaluation algorithms in databases, we propose a decentralized top-k query evaluation algorithm for peer-to-peer networks which makes use of local rankings, rank merging and optimized routing based on peer ranks, and minimizes both answer set size and network traffic among peers. As our algorithm is based on dynamically collected query statistics only, no continuous index update processes are necessary, allowing it to scale easily to large numbers of peers.
منابع مشابه
Semantic Query Routing and Distributed Top-k Query Processing in Peer-to-Peer Networks
Requirements for widely distributed information systems supporting virtual organizations have given rise to a new category of peer-to-peer (p2p) systems called schema-based. In such systems each peer is a database management system in itself, exposing its own schema. In such a setting, a main objective is the efficient search across peer databases by processing each incoming query without overl...
متن کاملSemantic-Driven K-Walkers-Based Search in Unstructured Peer-to-Peer Networks
In this paper, we present a semantic-driven kwalkers-based search scheme, called S2P2P, for data information dissemination and query routing in unstructured peer-to-peer (P2P) networks. In S2P2P, each peer maintains its observation on the semantics of received queries (demands) and data information (supplies), as well as a local view on network topology. On top of this, each peer, in line with ...
متن کاملDistributed RDF Query Processing and Reasoning in Peer-to-Peer Networks
With the interest in Semantic Web applications rising rapidly, the Resource Description Framework (RDF) and its accompanying vocabulary description language, RDF Schema (RDFS), have become one of the most widely used data models for representing and integrating structured information in the Web. RDF provides a simple and abstract knowledge representation for resources on the Web, while RDFS def...
متن کاملUnified Framework for Top-k Query Processing in Peer-to-Peer Networks
Supporting queries over dispersed data stored in large-scale distributed systems, such as peer-to-peer networks, naturally calls for ranked retrieval in order to effectively focus on the most relevant (i.e., top-k) results. While top-k retrieval has been actively studied lately, existing algorithms are too restrictive due to their assumptions about how the data is partitioned amongst the variou...
متن کاملA Detailed Evaluation of Threshold Algorithms for Answering Top-k queries in Peer-to-Peer Networks
Ranking queries, also known as top-k queries, have drawn considerable attention due to their usability in various applications. Several algorithms have been proposed for the evaluation of top-k queries. A large percentage of them follow the Threshold Approach. In p2p networks, top-k query processing can provide a lot of advantages both in time and bandwidth consumption. We focus on the main ada...
متن کامل